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Jfc (vi) CURRENT APPLICATION DATA: 

J, (A) APPLICATION NUMBER: 

(B) FILING DATE: 
W (C) CLASSIFICATION: 

^0 (viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Presser, Leopold 

(B) REGISTRATION NUMBER: 19,827 
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(2) INFORMATION FOB SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3588 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 2002.. 3081 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
GCTAGCCACC AGTGACGATG CCTTGAATTT GGCCATTCTG ACCCAGGCCC GTATTCTGAA 
grCCCGCATT CGCATTGTTA ATCGTTTGTT CAACCATGCC CTGGGTAAAC GTTTAGACAC 
I.CCTTGCCA GACCACGTTA GTTTGAGTGT TTCCGCCCTG GCGGCCCCGA TTTTTTCCTT 

Igcggctttg ggcaatcagg cgatcgggca attgcgtttg tttgaccaga cttggcccat 

IcAGGAAATT GTCATTCACC AAGACCATCC CTGGCTCAAT TTACCCCTGG CGGATTTATG 

Sgatgatccg agccgaatgt tgatctatta cctaccggcc cacagtgaaa cggatttagt 
Iggcgcagtg gtgaataatt taacgttgca atctggggac catttaatag tgggacaaaa 
^CCCCAACCC AAGACCAAAC ggcgatcgcc ttggcgcaaa ttttccaaac tgattaccaa 

IcTGCGGGAG TATCAGCGGT ATGTCCAACA GGTGATATGG GTGGTGTTGT TTTTATTGTT 
% GATGATTTTT CTGGCCACCT TCATCTACGT TTCCATTGAT CAACATATTG CCCCAGTGGA 
CGCGTTGTAT TTTTCCGTGG GCATGATTAC CGGGGCCGGT GGCAAGGAAG AGGTGGCCGA 
AAAGTCCCCC GATATCATCA AAGTATTCAC AGTGGTGATG ATGATCGCCG GGGCGGGGGT 
GATTGGTATT TGTTATGCCC TACTGAATGA TTTCATCCTT GGCAGTCGCT TTAGTCAGTT 
TTTGGATGCG GCCAAGTTAC CCGATCGCCA TCACATCATC ATTTGTGGGC TGGGGGGAGT 
GAGCATGGCC ATTATTGAAG AGTTAATTCA CCAGGGCCAT GAAATTGTGG TAATCGAAAA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 



55 
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GGATACAGAT AATCGTTTCT TGCATACGGC CCGCTCCCTG GGGGTGCCCG TAATTGTGGA 
GGATGCCCGC CTAGAAAGAA CGTTGGCCTG CGCCAATATC AACCGAGCCG AAGCCATTGT 
GGTGGCCACC AGCGACGACA CCGTTAACTT GGAAATTGGC CTAACTGCCA AGGCGATCGC 
CCCTAGCCTG CCAGTGGTGT TGCGTTGCCA GGATGCCCAG TTTAGCCTGT CCCTGCAGGA 
AGTATTTGAA TTTGAAACGG TGCTTTGTCC GGCGGAATTG GCCACCTATT CCTTTGCGGC 
OOCGGCCCTG GGGGGCAAAA TTTTGGGCAA CGGCATGACC GATGATTTGC TGTGGGTAGC 
CCTAGCCACC TTAATCACTC CTAACCATCC CTTTGCCGAC CAATTGGTTA AAATTGCAGC 
CCAAAAGTCT GATTTCGTTC CCCTCTATCT AGAACGGGGT GGCAAAACCA TCCATAGCTG 
S3AATTATTG GGTACCCATC TCGACTCTGG AGACGTGTTG TATTTAACCA TGCCCGCCAC 
Igccctagag CAACTTTGGC GATCGCCCCG TGCCACTGCT GATCCTCTGG ACTCTTTTTT 
Sgtttagcat GGGGGGATGG aactcttgac TCGGCCCAAT GGTGATCAAG AAAGAACGCT 
Itgtctatgt ttagtatttt taagttaacc AACAGCAGAG gataacttcc aaaagaaatt 
Sagctcaaaa agtagcaaaa taagtttaat tcataactga gttttactgc taaacagcgg 
Sgcaaaaaag tcagataaaa taaaagcttc acttcggttt tatattgtga ccatggttcc 
'Iaggcatctg ctctagggag tttttccgct gcctttagag agtattttct ccaagtcggc 
|aactccccc atttttaggc aaaatcatat acagactatc ccaatattgc cagagctttg 
"Itgactcact gtagaaggca gactaaaatt ctagcaatgg actcccagtt ggaataaatt 

TTTAGTCTCC CCCGGCGCTG GAGTTTTTTT GTAGTTAATG GCGGTATAAT GTGAAAGTTT 
TTTATCTATT TAAATTTATA A CTA ACA GCG GAA AGA „ AAA XTT ACC 



960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2031 

2079 



-56- 



sss:nssssssssssss 

45 50 

S S S S S5 S £ SS SS 2S S S SS S K S2 
S S S S Leu Zl S S S IS ESS S SS 2S S 

80 8b 
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Src ACC TAC GAT TTT GTC GGG TTA TCT AGT TTT CTT TGG CGC TAT CGC 
ft £yr Asp III Val Gly Leu Ser Ser Phe Leu Trp Arg Tyr Arg 



110 



PAC AAC TAT TTG CAC CAC ACC TAC ACC AAT ATT CTT GGC CAT GAC GTG 
fis aIh £yr Leu His His Thr Tyr Thr Asn He Leu Gly Hxs Asp Val 

IaA ATC CAT GGA GAT GGC GCA GTA CGT ATG AGT CCT GAA CAA GAA CAT 

^ lie His Gly Asp Gly Ala Val Arg Met Ser Pro Glu Gin Glu Hia 

m 140 145 

^TT GGT ATT TAT CGT TTC CAG CAA TTT TAT ATT TGG GGT TTA TAT CTT 

Val S£ Ue £yr Arg Phe Gin Gin Phe Tyr lie Trp Gly Leu Tyr Leu 
155 16° 165 

TTC ATT CCC TTT TAT TGG TTT CTC TAC GAT GTC TAC CTA GTG CTT AAT 
Phe iTl Pro ITe Tyr Trp Phe Leu Tyr Asp Val Tyr Leu Val Leu Asn 



175 



AAA GGC AAA TAT CAC GAC CAT AAA ATT CCT CCT TTC CAG CCC CTA GAA 
Lys Gly Lys Tyr 35 Asp His Lys He Pro Pro Phe Gin Pro Leu Glu 
190 195 



TTA GCT AGT TTG CTA GGG ATT AAG CTA TTA TGG CTC GGC TAC GTT TTC 
Leu Sa Ser ilu Leu Gly He Lys Leu Leu Trp Leu Gly Tyr Val Phe 
205 210 215 

HOC TTA CCT CTG GCT CTG GGC TTT TCC ATT CCT GAA GTA TTA ATT GGT 
IZu Pro LeS Ala Leu Gly Phe Ser He Pro Glu Val Leu He Gly 



220 225 

ATG ACC 
Met Thr _„ _ 
235 240 245 



OCT TCG GTA ACC TAT ATG ACC TAT GGC ATC GTG GTT TGC ACC ATC TTT 
£a sir wll tS ?yr Met Thr Tyr Gly He Val Val Cys Thr He Phe 



2175 



2319 



2367 



2415 



2559 



2703 



2751 



ATP PTG GCC CAT GTG TTG GAA TCA ACT GAA TTT CTC ACC CCC GAT GGT 2799 
Me? SS His vll Leu Olu Ser Thr Glu Phe Leu Thr Pro Asp Gly 



255 260 265 

™* T^r OCT GCC ATT GAT GAC GAG TGG GCT ATT TGC CAA ATT CGT ACC 

Glu SSr A^a lie Asp Asp Glu Trp Ala He Cys Gin lie Arg Thr 

270 275 

ACC GCC AAT TTT GCC ACC AAT AAT CCC TTT TGG AAC TGG TTT TGT GGC 

Thr Sa j£n Phe Sa Thr Asa Asn Pro Phe Trp Asn Trp Phe Cys Gly 

285 290 

GGT TTA AAT CAC CAA GTT ACC CAC CAT CTT TTC CCC AAT ATT TGT CAT 2943 

cty Leu £n Sis Gin Val Thr His His Leu Phe Pro Asn He Cys His 
Y 300 305 310 



2847 



2895 



2991 



ATT CAC TAT CCC CAA TTG GAA AAT ATT ATT AAG GAT GTT TGC CAA GAG 
Se His Tyr Pro Gin Leu Glu Asn He He Lys Asp Val Cys Gin Glu 
320 325 JJU 

|tT GGT GTG GAA TAT AAA GTT TAT CCC ACC TTC AAA GCG GCG ATC GCC 3039 
%l Gly Val Glu Tyr Lys Val Tyr Pro Thr Phe Lys Ala Ala He Ala 
* 335 340 



ibT AAC TAT CGC TGG CTA GAG GCC ATG GGC AAA GCA TCG TGACATTGCC 3088 
fer Asn Tyr Arg Trp Leu Glu Ala Met Gly Lys Ala Ser 
r 35Q 355 360 



|jrGGGATTGA AGCAAAATGG CAAAATCCCT CGTAAATCTA TGATCGAAGC CTTTCTGTTG 3148 

fcCGCCGACC AAATCCCCGA TGCTGACCAA AGGTTGATGT TGGCATTGCT CCAAACCCAC 3208 

^TTGAGGGGG TTCATTGGCC GCAGTTTCAA GCTGACCTAG GAGGCAAAGA TTGGGTGATT 3268 

TTGCTCAAAT CCGCTGGGAT ATTGAAAGGC TTCACCACCT TTGGTTTCTA CCCTGCTCAA 332 8 

TGGGAAGGAC AAACCGTCAG AATTGTTTAT TCTGGTGACA CCATCACCGA CCCATCCATG 3388 

TGGTCTAACC CAGCCCTGGC CAAGGCTTGG ACCAAGGCCA TGCAAATTCT CCACGAGGCT 3448 

AGGCCAGAAA AATTATATTG GCTCCTGATT TCTTCCGGCT ATCGCACCTA CCGATTTTTG 3508 

AGCATTTTTG CCAAGGAATT .CTATCCCCAC TATCTCCATC CCACTCCCCC GCCTGTACAA 3568 
AATTTTATCC ATCAGCTAGC 



3588 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 359 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Leu Thr Ala Glu Arg He Lys Phe Thr Gin Lys Arg Gly Phe Arg 

1 5 10 A:> 

Arg Val Leu Asn Gin Arg Val Asp Ala Tyr Phe Ala Glu His Gly Leu 



20 



Thr Gin Arg Asp Asn Pro Ser Met Tyr Leu Lys Thr Leu He He Val 
35 40 

Leu Trp Leu Phe Ser Ala Trp Ala Phe Val Leu Phe Ala Pro Val He 

50 55 60 

Phe Pro Val Arg Leu Leu Gly Cys Met Val Leu Ala He Ala Leu Ala 
35 70 75 

«U Phe Ser Phe Asn Val Gly His Asp Ala Asn His Asn Ala Tyr Ser 
85 90 

100 



fly Leu Ser Ser Phe Leu Trp Arg Tyr Arg His Asn Tyr Leu His His 
& y 120 125 



fer Asn Pro His He Asn Arg Val Leu Gly Met Thr Tyr Asp Phe Val 
100 105 110 

Ser Phe Leu Trp Arg 

fihr Tyr Thr Asn He Leu Gly His Asp Val Glu He His Gly Asp Gly 

^ 130 135 140 

'JO 

$la Val Arg Met Ser Pro Glu Gin Glu His Val Gly He Tyr Arg Phe 
1)45 150 155 1^0 

Gin Gin Phe Tyr lie Trp Gly Leu Tyr Leu Phe He Pro Phe Tyr Trp 
■*" i /- r- i in 1 /d 



165 



Phe Leu Tyr Asp Val Tyr Leu Val Leu Asn Lys Gly Lys Tyr His Asp 
180 inq 190 



185 1 9 0 



His Lys He Pro Pro Phe Gin Pro Leu Glu Leu Ala Ser Leu Leu Gly 
195 200 205 

He Lys Leu Leu Trp Leu Gly Tyr Val Phe Gly Leu Pro Leu Ala Leu 
210 215 220 

Gly Phe Ser He Pro Glu Val Leu He Gly Ala Ser Val Thr Tyr Met 
225 230 235 240 

Thr Tyr Gly He Val Val Cys Thr He Phe Met Leu Ala His Val Leu 



245 



Glu Ser Thr Glu Phe Leu Thr Pro Asp Gly Glu Ser Gly Ala lie Asp 
260 265 270 



5? 



Asp 


Glu 


Trp 
275 


Ala 


He 


Cys 


Gin 


He 
280 


Asn 


Asn 
290 


Pro 


Phe 


Trp 


Asn 


Trp 
295 


Phe 


Thr 
305 


His 


His 


Leu 


Phe 


Pro 
310 


Asn 


He 


Glu 


Asn 


He 


He 


Lys 
325 


Asp 


Val 


Cys 


Val 


Tyr 


Pro 


Thr 
340 


Phe 


Lys 


Ala 


Ala 


Glu 


Ala 


Met 


Gly 


Lys 


Ala 


Ser 
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Arg Thr Thr Ala Asn Phe Ala Thr 
285 

Cys Gly Gly Leu Asn His Gin Val 
300 

Cys His He His Tyr Pro Gin Leu 
315 320 

Gin Glu Phe Gly Val Glu Tyr Lys 
330 335 

He Ala Ser Asn Tyr Arg Trp Leu 
345 350 

Gly Lys Ala Ser 

355 

2 

3) INFORMATION FOR SEQ ID NO: 3: 

% <i) SEQUENCE CHARACTERISTICS: 

ft (A) LENGTH: 1884 base pairs 

? *j (b) TYPE: nucleic acid 

^ (C) STRANDEDNESS : both 

JL (D) TOPOLOGY: linear 

*3 (ii) MOLECULE TYPE: DNA (genomic) 

lx i) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

^GCTTCACTT CGGTTTTATA TTGTGACCAT GGTTCCCAGG CATCTGCTCT AGGGAGTTTT 60 
TCCGCTGCCT TTAGAGAGTA TTTTCTCCAA GTCGGCTAAC TCCCCCATTT TTAGGCAAAA 120 
TCATATACAG ACTATCCCAA TATTGCCAGA GCTTTGATGA CTCACTGTAG AAGGCAGACT 180 
AAAATTCTAG CAATGGACTC CCAGTTGGAA TAAATTTTTA GTCTCCCCCG GCGCTGGAGT 240 
TTTTTTGTAG TTAATGGCGG TATAATGTGA AAGTTTTTTA TCTATTTAAA TTTATAAATG 300 
CTAACAGCGG AAAGAATTAA ATTTACCCAG AAACGGGGGT TTCGTCGGGT ACTAAACCAA 360 
CGGGTGGATG CCTACTTTGC CGAGCATGGC CTGACCCAAA GGGATAATCC CTCCATGTAT 420 
CTGAAAACCC TGATTATTGT GCTCTGGTTG TTTTCCGCTT GGGCCTTTGT GCTTTTTGCT 480 
CCAGTTATTT TTCCGGTGCG CCTACTGGGT TGTATGGTTT TGGCGATCGC CTTGGCGGCC 
TTTTCCTTCA ATGTCGGCCA CGATGCCAAC CACAATGCCT ATTCCTCCAA TCCCCACATC 
AACCGGGTTC TGGGCATGAC CTACGATTTT GTCGGGTTAT CTAGTTTTCT TTGGCGCTAT 
CGCCACAACT ATTTGCACCA CACCTACACC AATATTCTTG GCCATGACGT GGAAATCCAT 



540 
600 
660 
720 
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GGAGATGGCG 


CAGTACGTAT 


GAGTCCTGAA 


CAAGAACATG 


TTGGTATTTA 


TCGTTTCCAG 


780 


CAATTTTATA 


TTTGGGGTTT 


ATATCTTTTC 


ATTCCCTTTT 


ATTGGTTTCT 


CTACGATGTC 


840 


TACCTAGTGC 


TTAATAAAGG 


CAAATATCAC 


GACCATAAAA 


TTCCTCCTTT 


CCAGCCCCTA 


900 


GAATTAGCTA 


GTTTGCTAGG 


GATTAAGCTA 


TTATGGCTCG 


GCTACGTTTT 


CGGCTTACCT 


960 


CTGGCTCTGG 


GCTTTTCCAT 


TCCTGAAGTA 


TTAATTGGTG 


CTTCGGTAAC 


CTATATGACC 


1020 


TATGGCATCG 


TGGTTTGCAC 


CATCTTTATG 


CTGGCCCATG 


TGTTGGAATC 


AACTGAATTT 


1080 


CTCACCCCCG 


ATGGTGAATC 


CGGTGCCATT 


GATGACGAGT 


GGGCTATTTG 


CCAAATTCGT 


1140 


ACCACGGCCA 


ATTTTGCCAC 


CAATAATCCC 


TTTTGGAACT 


GGTTTTGTGG 


CGGTTTAAAT 


1200 


CACCAAGTTA 


CCCACCATCT 


TTTCCCCAAT 


ATTTGTCATA 


TTCACTATCC 


CCAATTGGAA 


1260 


o 

^ATATTATTA 


AGGATGTTTG 


CCAAGAGTTT 


GGTGTGGAAT 


ATAAAGTTTA 


TCCCACCTTC 


1320 


JJJaAGCGGCGA 


TCGCCTCTAA 


CTATCGCTGG 


CTAGAGGCCA 


TGGGCAAAGC 


ATCGTGACAT 


1380 


SfGCCTTGGGA 


TTGAAGCAAA 


ATGGCAAAAT 


CCCTCGTAAA 


TCTATGATCG 


AAGCCTTTCT 


1440 


iGTTGCCCGCC 


GACCAAATCC 


CCGATGCTGA 


CCAAAGGTTG 


ATGTTGGCAT 


TGCTCCAAAC 


1500 


JTCACTTTGAG 


GGGGTTCATT 


GGCCGCAGTT 


TCAAGCTGAC 


CTAGGAGGCA 


AAGATTGGGT 


1560 


^GATTTTGCTC 

l_i 

# , CAATGGGAA 


AAATCC GCTG 


GGATATTGAA 


AGGCTTCACC 


ACCTTTGGTT 


TCTACCCTGC 


1620 


GGACAAACCG 


TCAGAATTGT 


TTATTCTGGT 


GACACCATCA 


CCGACCCATC 


1680 


U S 

^CATGTGGTCT 


AACCCAGCCC 


TGGCCAAGGC 


TTGGACCAAG 


GCCATGCAAA 


TTCTCCACGA 


1740 


GGCTAGGCCA 


GAAAAATTAT 


ATTGGCTCCT 


GATTTCTTCC 


GGCTATCGCA 


CCTACCGATT 


1800 


TTTGAGCATT 


TTTGCCAAGG 


AATTCTATCC 


CCACTATCTC 


CATCCCACTC 


CCCCGCCTGT 


1860 


ACAAAATTTT 


ATCCATCAGC 


TAGC 








1884 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1685 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
AATATCTGCC TACCCTCCCA AAGAGAGTAG TCATTTTTCA TCAATGGCTG CTCAAATCAA 
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GAAATACATT 
GATTCAAGGG 
TCCCTTGAAG 
CTCTACATGG 
TTCTGAGGTT 
TGACAAAAAA 
GAGTGTTTAT 
GATGGGGTTT 
^GTGTCTGAT 
JpATAAGTATT 
^GAATATGAC 
pTCACTCACC 
#GTAAGTTAT 
CjGTACAATCT 
p=€TTGGGATGC 
^GGTGAAAGA 
6TTCTCCTTG 
GTTTGAGAAA 
TCATGGTGGA 
CCTTAGGAAA 
TTATGCATCT 
GCAGGCTAGG 
TCATGGTTAA 
GTGTCTTGTC 
ATGTTTTTTA 
TGCATATTGT 
TGTTTTCAGT 
TATTT 



ACCTCAGATG 
AAAGCCTATG 
AGTCTTGCTG 
AAGAATCTTG 
TCTAAAGATT 
GGTCATATTA 
GGGGTTTTGT 
CTTTGGATTC 
TCAAGGCTTA 
GGTTGGTGGA 
CCTGATTTAC 
TCTCATTTCT 
CAACATTGGA 
CTCATAATGT 
CTAGTGTTCT 
ATTATGTTTG 
AACCACTTCT 
CAAACGGATG 
TTGCAATTCC 
ATCTCGCCCT 
TTCTCCAAGG 
GATATAACCA 
AATTACCCTT 
TTGGTTCTAC 
ATATATTTTA 
CAATTGTTGT 
TGAAGCTCAT 



AACTCAAGAA 
ATGTTTCGGA 
GTCAAGAGGT 
ATAAGTTTTT 
ATAGGAAGCT 
TGTTTGCAAC 
TTTGTGAGGG 
AGAGTGGTTG 
ATAAGTTTAT 
AATGGAACCA 
AATATATACC 
ATGAGAAAAG 
CATTTTACCC 
TGTTGACCAA 
CGATTTGGTA 
TTATTGCAAG 
CTTCAAGTGT 
GGACACTTGA 
AAATTGAGCA 
ACGTGATCGA 
CCAATGAAAT 
AGCCGCTCCC 
AGTTCATGTA 
TTGTTGGAGT 
GAGGTTTTGC 
GCTCAATATC 
GTGTACTTCT 



C C AC GAT AAA 
TTGGGTGAAA 
AACTGATGCA 
CACTGGGTAT 
TGTGTTTGAG 
TTTGTGCTTT 
TGTTTTGGTA 
GATTGGACAT 
GGGTATTTTT 
TAATGCACAT 
ATTCCTTGTT 
GTTGACTTTT 
TATTATGTGT 
GAGAAATGTG 
CCCGTTGCTT 
TTTATCAGTG 
TTATGTTGGA 
CATTTCTTGT 
TCATTTGTTT 
GTTATGCAAG 
GACACTCAGA 
GAAGAATTTG 
ATAATTTGAG 
CATTGCAACT 
TTTCATCTCC 
TGATATTTTG 
ATAGACTTTG 



CCCGGAGATC 
GACCATCCAG 
TTTGTTGCAT 
TATCTTAAAG 
TTTTCTAAAA 
ATAGCAATGC 
CATTTGTTTT 
GATGCTGGGC 
GCTGCAAATT 
CACATTGCCT 
GTGTCTTCCA 
GACTCTTTAT 
GCTGCTAGGC 
TCCTATCGAG 
GTTTCTTGTT 
ACTGGAATGC 
AAGCCTAAAG 
CCTCCTTGGA 
CCCAAGATGC 
AAACATAATT 
ACATTGAGGA 
GTATGGGAAG 
ATTATGTATC 
TGTCTTTTAT 
ATTATTGATG 
GAATGTACTT 
TTTAAATGGT 



TATGGATCTC 
GTGGCAGCTT 
TCCATCCTGC 
ATTACTCTGT 
TGGGTTTGTA 
TGTTTGCTAT 
CTGGGTGTTT 
ATTATATGGT 
GTCTTTCAGG 
GTAATAGCCT 
AGTTTTTTGG 
CAAGATTCTT 
TCAATATGTA 
CTCAGGAACT 
TGCCTAATTG 
AACAAGTTCA 
GGAATAATTG 
TGGATTGGTT 
CTAGATGCAA 
TGCCTTACAA 
ACACAGCATT 
CTCTTCACAC 
TCCTATGTTT 
GGTTTATTAG 
AATAAGGAGT 
TGTACCACTG 
TATGTCATGT 



120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1685 



# # 
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(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 448 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Met Ala Ala Gin lie Lys Lys Tyr He Thr Ser Asp Glu Leu Lys Asn 
His Asp Lys Pro Gly Asp Leu Trp lie Ser He Gin Gly Lys Ala Tyr 



20 



Asp val ser Asp Trp val Lys Asp His Pro Gly Gly Ser Phe Pro Leu 

35 40 
Lys ser Leu Ala Gly Gin Glu val Thr Asp Ala Phe Val Ala Phe His 

50 55 
Pro Ala ser Thr Trp Lys Asn Leu Asp Lys Phe Phe Thr Gly Tyr Tyr 
65 70 " 

Leu Lys Asp Tyr Ser Val Ser Glu Val Ser Lys Asp Tyr Arg Lys Leu 

85 * u 
val Phe Glu Phe Ser Lys Met Gly Leu Tyr *sp Lys Lys Gly His He 

100 10S 
Met Phe Ala Thr Leu Cys Phe lie Ala Met Leu Phe Ala Met Ser Val 

115 120 
Tyr Gly Val Leu Phe Cys Glu Gly Val Leu Val His Leu Phe Ser Gly 

130 135 
Cys Leu Met Gly Phe Leu Trp He Gin Ser Gly Trp He Gly His Asp 
145 150 

Ala Gly His Tyr Met Val Val Ser Asp Ser Arg Leu Asn Lys Phe Met 
Gly lie Phe Ala Ala Asn Cys Leu Ser Gly He Ser He Gly Trp Trp 



180 



Lys Trp Asn His Asn Ala His His He Ala Cys Asn Ser Leu Glu Tyr 

Asp Pro Asp Leu Gin Tyr He Pro Phe Leu Val Val Ser Ser Lys Phe 
210 215 220 



/ ^7 



245 



Phe Gly Ser Leu Thr Ser His Phe Tyr Glu Lys Arg Leu Thr Phe Asp 
225 230 

Ser Leu Ser Arg Phe Phe Val Ser Tyr Gin His Trp Thr Phe Tyr Pro 

Tyr Val Gin Ser 

_ Arg Ala Gin Glu 
275 " 280 285 



He Met Cys Ala Ala Arg Leu Asn Met Tyr Val Gin Ser Leu He Met 
260 265 z/u 

Leu Leu Thr Lys Arg Asn Val Ser Tyr Arg Ala Gin Glu Leu Leu Gly 
275 280 285 

Cys Leu Val Phe Ser He Trp Tyr Pro Leu Leu Val Ser Cys Leu Pro 
290 295 300 

Asn Trp Gly Glu Arg He Met Phe Val He Ala Ser Leu Ser Val Thr 
3 305 310 315 320 

I Gly Met Gin Gin Val Gin Phe Ser Leu Asn His Phe Ser Ser Ser Val 
j 325 330 3Jb 

: Tyr Val Gly Lys Pro Lys Gly Asn Asn Trp Phe Glu Lys Gin Thr Asp 

* 340 345 -"U 

* Gly Thr Leu Asp He Ser Cys Pro Pro Trp Met Asp Trp Phe His Gly 

355 360 365 

5 Gly Ser Gin Phe Gin He Glu His His Leu Phe Pro Lys Met Pro Arg 

i- 370 375 380 

j cys Asn Leu Arg Lys He Ser Pro Tyr Val lie Glu Leu Cys Lys Lys 

| 385 390 395 400 

His Asn Leu Pro Tyr Asn Tyr Ala Ser Phe Ser Lys Ala Asn Glu Met 
405 410 415 

Thr Leu Arg Thr Leu Arg Asn Thr Ala Leu Gin Ala Arg Asp He Thr 
420 425 430 

Lys Pro Leu Pro Lys Asn Leu Val Trp Glu Ala Leu His Thr His Gly 
435 440 445 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Peptide 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
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Trp He Gly His Asp Ala Gly His 
1 5 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Asn Val Gly His Asp Ala Asn His 

O 1 5 

CO 

#2) INFORMATION FOR SEQ ID NO: 8: 

2 (i) SEQUENCE CHARACTERISTICS: 

In (A) LENGTH: 8 amino acids 

^ (B) TYPE: amino acid 

^ (D) TOPOLOGY: linear 

Jg (ii) MOLECULE TYPE: Peptide 

?J (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8 

Val Leu Gly His Asp Cys Gly His 

M 1 5 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (peptide) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9 

Val He Ala His Glu Cys Gly His 
1 5 



